Overview

Dataset statistics

Number of variables18
Number of observations265
Missing cells368
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory67.9 KiB
Average record size in memory262.5 B

Variable types

NUM16
CAT2

Warnings

SEM is highly correlated with MES and 2 other fieldsHigh correlation
MES is highly correlated with SEM and 2 other fieldsHigh correlation
COMP_ENV is highly correlated with CAR_PKLHigh correlation
CAR_PKL is highly correlated with COMP_ENV and 3 other fieldsHigh correlation
VENTA is highly correlated with CAR_PKL and 2 other fieldsHigh correlation
COSTO is highly correlated with CAR_PKL and 2 other fieldsHigh correlation
UTIL is highly correlated with CAR_PKL and 2 other fieldsHigh correlation
MES_CFD is highly correlated with MES and 2 other fieldsHigh correlation
SEM_CFD is highly correlated with MES and 2 other fieldsHigh correlation
CAR_PAQ has 3 (1.1%) missing values Missing
COMP_PAQ has 181 (68.3%) missing values Missing
LACT_Q has 184 (69.4%) missing values Missing
FECHA has unique values Unique
F_CFD has unique values Unique
BEB_REF has 6 (2.3%) zeros Zeros
DESC has 152 (57.4%) zeros Zeros
C_CFD has 88 (33.2%) zeros Zeros

Reproduction

Analysis started2020-12-16 02:53:26.095423
Analysis finished2020-12-16 02:55:45.428822
Duration2 minutes and 19.33 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

MES
Real number (ℝ≥0)

HIGH CORRELATION

Distinct9
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.871698113
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Memory size2.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.519347661
Coefficient of variation (CV)0.5171395276
Kurtosis-1.198292884
Mean4.871698113
Median Absolute Deviation (MAD)2
Skewness0.01420693516
Sum1291
Variance6.347112636
MonotocityIncreasing
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
83111.7%
 
73111.7%
 
53111.7%
 
33111.7%
 
13111.7%
 
63011.3%
 
43011.3%
 
22910.9%
 
9217.9%
 
ValueCountFrequency (%) 
13111.7%
 
22910.9%
 
33111.7%
 
43011.3%
 
53111.7%
 
63011.3%
 
73111.7%
 
83111.7%
 
9217.9%
 
ValueCountFrequency (%) 
9217.9%
 
83111.7%
 
73111.7%
 
63011.3%
 
53111.7%
 
43011.3%
 
33111.7%
 
22910.9%
 
13111.7%
 

SEM
Real number (ℝ≥0)

HIGH CORRELATION

Distinct39
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.71320755
Minimum1
Maximum39
Zeros0
Zeros (%)0.0%
Memory size2.2 KiB

Quantile statistics

Minimum1
5-th percentile3
Q110
median20
Q329
95-th percentile37
Maximum39
Range38
Interquartile range (IQR)19

Descriptive statistics

Standard deviation10.95275733
Coefficient of variation (CV)0.5556050329
Kurtosis-1.198329576
Mean19.71320755
Median Absolute Deviation (MAD)9
Skewness-0.0005269962524
Sum5224
Variance119.9628931
MonotocityIncreasing
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
2072.6%
 
3872.6%
 
1872.6%
 
1772.6%
 
1672.6%
 
1572.6%
 
1472.6%
 
1372.6%
 
1272.6%
 
1172.6%
 
1072.6%
 
972.6%
 
872.6%
 
772.6%
 
672.6%
 
572.6%
 
472.6%
 
1972.6%
 
2172.6%
 
272.6%
 
2272.6%
 
3772.6%
 
3672.6%
 
3572.6%
 
3472.6%
 
Other values (14)9034.0%
 
ValueCountFrequency (%) 
151.9%
 
272.6%
 
372.6%
 
472.6%
 
572.6%
 
672.6%
 
772.6%
 
872.6%
 
972.6%
 
1072.6%
 
ValueCountFrequency (%) 
3910.4%
 
3872.6%
 
3772.6%
 
3672.6%
 
3572.6%
 
3472.6%
 
3372.6%
 
3272.6%
 
3172.6%
 
3072.6%
 

FECHA
Categorical

UNIQUE

Distinct265
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
06/06/2020
 
1
21/09/2020
 
1
12/09/2020
 
1
10/05/2020
 
1
10/09/2020
 
1
Other values (260)
260 
ValueCountFrequency (%) 
06/06/202010.4%
 
21/09/202010.4%
 
12/09/202010.4%
 
10/05/202010.4%
 
10/09/202010.4%
 
17/02/202010.4%
 
30/04/202010.4%
 
02/06/202010.4%
 
25/08/202010.4%
 
16/03/202010.4%
 
18/05/202010.4%
 
05/02/202010.4%
 
02/02/202010.4%
 
28/04/202010.4%
 
15/03/202010.4%
 
02/04/202010.4%
 
29/03/202010.4%
 
18/01/202010.4%
 
11/01/202010.4%
 
25/07/202010.4%
 
28/01/202010.4%
 
05/09/202010.4%
 
22/02/202010.4%
 
06/02/202010.4%
 
07/07/202010.4%
 
Other values (240)24090.6%
 
Frequencies of value counts

Unique

Unique265 ?
Unique (%)100.0%
Histogram of lengths of the category

Length

Max length10
Median length10
Mean length10
Min length10

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number212080.0%
 
Other Punctuation53020.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
090142.5%
 
266731.5%
 
11537.2%
 
3693.3%
 
5572.7%
 
7572.7%
 
8572.7%
 
4562.6%
 
6562.6%
 
9472.2%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/530100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common2650100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII2650100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

BEB_REF
Real number (ℝ≥0)

ZEROS

Distinct34
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.288679245
Minimum0
Maximum35
Zeros6
Zeros (%)2.3%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile2
Q14
median7
Q312
95-th percentile23
Maximum35
Range35
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.093575315
Coefficient of variation (CV)0.76367965
Kurtosis1.211701267
Mean9.288679245
Median Absolute Deviation (MAD)3
Skewness1.320725831
Sum2461.5
Variance50.31881075
MonotocityNot monotonic
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%) 
53412.8%
 
42710.2%
 
6249.1%
 
7207.5%
 
3197.2%
 
8145.3%
 
9145.3%
 
11114.2%
 
10114.2%
 
2103.8%
 
1283.0%
 
2172.6%
 
062.3%
 
1662.3%
 
2262.3%
 
151.9%
 
2351.9%
 
1551.9%
 
1351.9%
 
1941.5%
 
1841.5%
 
2031.1%
 
1431.1%
 
2720.8%
 
2820.8%
 
Other values (9)103.8%
 
ValueCountFrequency (%) 
062.3%
 
151.9%
 
2103.8%
 
3197.2%
 
42710.2%
 
53412.8%
 
6249.1%
 
7207.5%
 
7.510.4%
 
8145.3%
 
ValueCountFrequency (%) 
3510.4%
 
3210.4%
 
3110.4%
 
3020.8%
 
2910.4%
 
2820.8%
 
2720.8%
 
2610.4%
 
2510.4%
 
2351.9%
 

CAR_PAQ
Real number (ℝ≥0)

MISSING

Distinct47
Distinct (%)17.9%
Missing3
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean11.86641221
Minimum0
Maximum99
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median7
Q314
95-th percentile36.9
Maximum99
Range99
Interquartile range (IQR)10

Descriptive statistics

Standard deviation13.24450648
Coefficient of variation (CV)1.116134029
Kurtosis9.32851611
Mean11.86641221
Median Absolute Deviation (MAD)4
Skewness2.565280147
Sum3109
Variance175.4169518
MonotocityNot monotonic
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%) 
4238.7%
 
5207.5%
 
1207.5%
 
8197.2%
 
2197.2%
 
3197.2%
 
7155.7%
 
6145.3%
 
10124.5%
 
12103.8%
 
1193.4%
 
983.0%
 
1851.9%
 
1451.9%
 
1551.9%
 
3441.5%
 
1641.5%
 
2631.1%
 
4331.1%
 
2531.1%
 
1931.1%
 
3531.1%
 
2131.1%
 
2420.8%
 
3220.8%
 
Other values (22)2910.9%
 
(Missing)31.1%
 
ValueCountFrequency (%) 
020.8%
 
1207.5%
 
2197.2%
 
3197.2%
 
4238.7%
 
5207.5%
 
6145.3%
 
7155.7%
 
8197.2%
 
983.0%
 
ValueCountFrequency (%) 
9910.4%
 
7010.4%
 
6410.4%
 
5410.4%
 
5320.8%
 
4810.4%
 
4510.4%
 
4331.1%
 
4210.4%
 
3810.4%
 

CAR_PKL
Real number (ℝ≥0)

HIGH CORRELATION

Distinct263
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.80792917
Minimum0
Maximum150.47143
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile7.378816
Q114.3999
median23.3879
Q341.81135
95-th percentile80.430516
Maximum150.47143
Range150.47143
Interquartile range (IQR)27.41145

Descriptive statistics

Standard deviation24.41681981
Coefficient of variation (CV)0.7676331169
Kurtosis2.264236552
Mean31.80792917
Median Absolute Deviation (MAD)10.65933
Skewness1.456451917
Sum8429.10123
Variance596.1810897
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
020.8%
 
9.7520.8%
 
7.5510.4%
 
25.8381310.4%
 
13.762210.4%
 
38.1110.4%
 
21.7785510.4%
 
37.76510.4%
 
18.3349110.4%
 
26.310.4%
 
5.0710.4%
 
69.5510.4%
 
47.8310510.4%
 
90.7107610.4%
 
41.7568310.4%
 
74.2746410.4%
 
32.1994210.4%
 
65.5109110.4%
 
15.8812510.4%
 
17.0872610.4%
 
10.18510.4%
 
26.1276210.4%
 
15.544110.4%
 
21.8310.4%
 
27.0484110.4%
 
Other values (238)23889.8%
 
ValueCountFrequency (%) 
020.8%
 
1.810.4%
 
210.4%
 
2.1627910.4%
 
2.4310.4%
 
2.9310.4%
 
5.0268210.4%
 
5.0710.4%
 
6.8020910.4%
 
7.1214310.4%
 
ValueCountFrequency (%) 
150.4714310.4%
 
108.5253410.4%
 
107.8599610.4%
 
105.01510.4%
 
95.785310.4%
 
93.0668910.4%
 
92.5004410.4%
 
90.7107610.4%
 
87.3579810.4%
 
85.4623810.4%
 

COMP_ENV
Real number (ℝ≥0)

HIGH CORRELATION

Distinct111
Distinct (%)41.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.48018868
Minimum0
Maximum112
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile7.1
Q114
median21
Q336.5
95-th percentile69.8
Maximum112
Range112
Interquartile range (IQR)22.5

Descriptive statistics

Standard deviation20.75398907
Coefficient of variation (CV)0.7287166987
Kurtosis1.592213484
Mean28.48018868
Median Absolute Deviation (MAD)9
Skewness1.355259784
Sum7547.25
Variance430.7280625
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
12103.8%
 
20.583.0%
 
21.583.0%
 
14.572.6%
 
972.6%
 
1362.3%
 
1562.3%
 
2062.3%
 
10.562.3%
 
1751.9%
 
1951.9%
 
1051.9%
 
2151.9%
 
22.541.5%
 
36.541.5%
 
1841.5%
 
30.541.5%
 
17.541.5%
 
31.541.5%
 
27.541.5%
 
18.541.5%
 
5131.1%
 
16.531.1%
 
15.531.1%
 
13.531.1%
 
Other values (86)13751.7%
 
ValueCountFrequency (%) 
020.8%
 
2.510.4%
 
310.4%
 
3.510.4%
 
410.4%
 
510.4%
 
5.520.8%
 
631.1%
 
6.510.4%
 
710.4%
 
ValueCountFrequency (%) 
11210.4%
 
110.510.4%
 
91.510.4%
 
9110.4%
 
8010.4%
 
79.510.4%
 
78.510.4%
 
7810.4%
 
7710.4%
 
75.510.4%
 

COMP_PAQ
Real number (ℝ≥0)

MISSING

Distinct9
Distinct (%)10.7%
Missing181
Missing (%)68.3%
Infinite0
Infinite (%)0.0%
Mean3.238095238
Minimum0
Maximum17
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q34
95-th percentile6
Maximum17
Range17
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.382764657
Coefficient of variation (CV)0.7358537913
Kurtosis12.23984628
Mean3.238095238
Median Absolute Deviation (MAD)1
Skewness2.578440819
Sum272
Variance5.677567413
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
23212.1%
 
4207.5%
 
6124.5%
 
1124.5%
 
820.8%
 
320.8%
 
020.8%
 
1710.4%
 
510.4%
 
(Missing)18168.3%
 
ValueCountFrequency (%) 
020.8%
 
1124.5%
 
23212.1%
 
320.8%
 
4207.5%
 
510.4%
 
6124.5%
 
820.8%
 
1710.4%
 
ValueCountFrequency (%) 
1710.4%
 
820.8%
 
6124.5%
 
510.4%
 
4207.5%
 
320.8%
 
23212.1%
 
1124.5%
 
020.8%
 

LACT_Q
Real number (ℝ≥0)

MISSING

Distinct5
Distinct (%)6.2%
Missing184
Missing (%)69.4%
Infinite0
Infinite (%)0.0%
Mean1.345679012
Minimum0
Maximum4
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile3
Maximum4
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.6921071779
Coefficient of variation (CV)0.5143181781
Kurtosis2.613059332
Mean1.345679012
Median Absolute Deviation (MAD)0
Skewness1.504362408
Sum109
Variance0.4790123457
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
15621.1%
 
2176.4%
 
351.9%
 
020.8%
 
410.4%
 
(Missing)18469.4%
 
ValueCountFrequency (%) 
020.8%
 
15621.1%
 
2176.4%
 
351.9%
 
410.4%
 
ValueCountFrequency (%) 
410.4%
 
351.9%
 
2176.4%
 
15621.1%
 
020.8%
 

TOR_TOR
Real number (ℝ≥0)

Distinct56
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.3309434
Minimum0
Maximum70
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile2.2
Q17
median11
Q323
95-th percentile45.6
Maximum70
Range70
Interquartile range (IQR)16

Descriptive statistics

Standard deviation14.02628299
Coefficient of variation (CV)0.8588776933
Kurtosis1.609754164
Mean16.3309434
Median Absolute Deviation (MAD)6
Skewness1.44140782
Sum4327.7
Variance196.7366146
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
8238.7%
 
6238.7%
 
9186.8%
 
11155.7%
 
12145.3%
 
7134.9%
 
4114.2%
 
2103.8%
 
393.4%
 
583.0%
 
1483.0%
 
1783.0%
 
1051.9%
 
3451.9%
 
1351.9%
 
2951.9%
 
1851.9%
 
2051.9%
 
2341.5%
 
2641.5%
 
1541.5%
 
1941.5%
 
2841.5%
 
4331.1%
 
3831.1%
 
Other values (31)4918.5%
 
ValueCountFrequency (%) 
020.8%
 
120.8%
 
2103.8%
 
393.4%
 
4114.2%
 
583.0%
 
6238.7%
 
7134.9%
 
8238.7%
 
9186.8%
 
ValueCountFrequency (%) 
7010.4%
 
6610.4%
 
6410.4%
 
6010.4%
 
5410.4%
 
5310.4%
 
5220.8%
 
5120.8%
 
5010.4%
 
4910.4%
 

VENTA
Real number (ℝ≥0)

HIGH CORRELATION

Distinct264
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11979.41449
Minimum0
Maximum65869.64217
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile2668.047062
Q15338.96422
median8355.30021
Q315988.17494
95-th percentile30474.96296
Maximum65869.64217
Range65869.64217
Interquartile range (IQR)10649.21072

Descriptive statistics

Standard deviation9868.453484
Coefficient of variation (CV)0.8237842916
Kurtosis5.8368932
Mean11979.41449
Median Absolute Deviation (MAD)3810.80084
Skewness1.979130139
Sum3174544.841
Variance97386374.16
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
020.8%
 
5837.644810.4%
 
2478.9986210.4%
 
30638.3234410.4%
 
4311.8018810.4%
 
8003.0495610.4%
 
4791.7785410.4%
 
31952.7876210.4%
 
11180.2311810.4%
 
8609.39510.4%
 
7805.5599810.4%
 
17283.6902710.4%
 
17145.2291410.4%
 
4814.93210.4%
 
12100.0962810.4%
 
65869.6421710.4%
 
25700.3410.4%
 
6398.37510.4%
 
21383.0063910.4%
 
9705.210.4%
 
7631.68550410.4%
 
15580.2412210.4%
 
6588.3987410.4%
 
13402.9305210.4%
 
5356.3195110.4%
 
Other values (239)23990.2%
 
ValueCountFrequency (%) 
020.8%
 
953.210.4%
 
1328.4710.4%
 
1485.9999110.4%
 
2024.6910.4%
 
2400.9299910.4%
 
2418.5310.4%
 
2478.9986210.4%
 
2500.7510.4%
 
2521.2599310.4%
 
ValueCountFrequency (%) 
65869.6421710.4%
 
65186.36510.4%
 
39419.3689410.4%
 
37653.871410.4%
 
37149.3685610.4%
 
35925.0889110.4%
 
35904.0994410.4%
 
35543.0574610.4%
 
34049.2923410.4%
 
33549.6365910.4%
 

COSTO
Real number (ℝ≥0)

HIGH CORRELATION

Distinct264
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5977.859799
Minimum0
Maximum34543.85635
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile1353.027384
Q12662.49746
median3998.3555
Q37845.966177
95-th percentile15573.27431
Maximum34543.85635
Range34543.85635
Interquartile range (IQR)5183.468717

Descriptive statistics

Standard deviation5014.677071
Coefficient of variation (CV)0.8388749886
Kurtosis6.381239694
Mean5977.859799
Median Absolute Deviation (MAD)1800.8645
Skewness2.050190953
Sum1584132.847
Variance25146986.12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
020.8%
 
1432.029910.4%
 
6970.62739810.4%
 
3949.13110.4%
 
3729.0060910.4%
 
3638.074210.4%
 
3845.2594510.4%
 
3714.728910.4%
 
2702.14266610.4%
 
6043.1464910.4%
 
2221.77610.4%
 
14879.1801210.4%
 
4596.12736810.4%
 
7362.091510.4%
 
8194.100510.4%
 
10194.4151210.4%
 
3929.32577410.4%
 
4793.047910.4%
 
17041.08710.4%
 
7071.71296210.4%
 
3889.59715810.4%
 
5302.936410.4%
 
6724.6013110.4%
 
4286.940210.4%
 
4338.1210.4%
 
Other values (239)23990.2%
 
ValueCountFrequency (%) 
020.8%
 
49910.4%
 
598.04110.4%
 
872.1544410.4%
 
910.869810.4%
 
1118.633410.4%
 
1263.25250210.4%
 
1301.724610.4%
 
1310.43810.4%
 
1318.422510.4%
 
ValueCountFrequency (%) 
34543.8563510.4%
 
33142.780510.4%
 
18479.9638510.4%
 
18442.6093410.4%
 
18435.2000910.4%
 
17588.7915710.4%
 
17307.5383810.4%
 
17217.5902210.4%
 
17041.08710.4%
 
16733.616710.4%
 

UTIL
Real number (ℝ≥0)

HIGH CORRELATION

Distinct264
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6009.031639
Minimum0
Maximum32065.5845
Zeros2
Zeros (%)0.8%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile1262.86526
Q12561.10017
median4228.165356
Q37509.720343
95-th percentile16850.28883
Maximum32065.5845
Range32065.5845
Interquartile range (IQR)4948.620173

Descriptive statistics

Standard deviation5040.687346
Coefficient of variation (CV)0.8388518565
Kurtosis4.817947765
Mean6009.031639
Median Absolute Deviation (MAD)1970.315506
Skewness1.914499273
Sum1592393.384
Variance25408528.92
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
020.8%
 
10751.3748510.4%
 
3048.15220510.4%
 
16934.8710210.4%
 
5925.8164210.4%
 
6035.759510.4%
 
5987.8697610.4%
 
8274.72712810.4%
 
2139.67410.4%
 
14547.9580910.4%
 
4567.231810.4%
 
464.210.4%
 
1805.2121710.4%
 
3022.2430510.4%
 
21041.5688510.4%
 
13130.4567510.4%
 
3735.532710.4%
 
6510.868510.4%
 
613.8454710.4%
 
2807.0039210.4%
 
4485.059510.4%
 
7238.2345810.4%
 
6975.4447610.4%
 
3370.613210.4%
 
2516.2425910.4%
 
Other values (239)23990.2%
 
ValueCountFrequency (%) 
020.8%
 
464.210.4%
 
613.8454710.4%
 
730.42910.4%
 
801.62610.4%
 
824.24310.4%
 
1068.09610.4%
 
1100.107510.4%
 
1113.820210.4%
 
1147.3800810.4%
 
ValueCountFrequency (%) 
32065.584510.4%
 
31357.7858210.4%
 
21041.5688510.4%
 
19205.9075510.4%
 
18856.6972810.4%
 
18817.5592210.4%
 
18708.4986910.4%
 
18596.5610610.4%
 
18350.8560410.4%
 
17870.9786210.4%
 

DESC
Real number (ℝ≥0)

ZEROS

Distinct72
Distinct (%)27.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.476925075
Minimum0
Maximum110.700023
Zeros152
Zeros (%)57.4%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33.7
95-th percentile42.652
Maximum110.700023
Range110.700023
Interquartile range (IQR)3.7

Descriptive statistics

Standard deviation16.530181
Coefficient of variation (CV)2.210826086
Kurtosis11.34099833
Mean7.476925075
Median Absolute Deviation (MAD)0
Skewness3.140150361
Sum1981.385145
Variance273.2468838
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
015257.4%
 
3.2186.8%
 
1.993.4%
 
1.362.3%
 
3.762.3%
 
2.631.1%
 
3.820.8%
 
5.9720.8%
 
12.920.8%
 
22.120.8%
 
2.520.8%
 
21.19910.4%
 
18.119510.4%
 
110.70002310.4%
 
5.210.4%
 
40.661510.4%
 
2.410.4%
 
29.38824210.4%
 
26.262510.4%
 
7.510.4%
 
5.710.4%
 
45.910.4%
 
7.00005610.4%
 
28.59510.4%
 
3.510.4%
 
Other values (47)4717.7%
 
ValueCountFrequency (%) 
015257.4%
 
1.362.3%
 
1.993.4%
 
2.410.4%
 
2.520.8%
 
2.631.1%
 
2.710.4%
 
3.2186.8%
 
3.510.4%
 
3.762.3%
 
ValueCountFrequency (%) 
110.70002310.4%
 
93.610.4%
 
78.76810.4%
 
75.58910.4%
 
67.2510.4%
 
65.410.4%
 
59.710.4%
 
52.6210.4%
 
52.22110.4%
 
52.207310.4%
 

MES_CFD
Real number (ℝ≥0)

HIGH CORRELATION

Distinct9
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.871698113
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Memory size2.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.519347661
Coefficient of variation (CV)0.5171395276
Kurtosis-1.198292884
Mean4.871698113
Median Absolute Deviation (MAD)2
Skewness0.01420693516
Sum1291
Variance6.347112636
MonotocityIncreasing
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
83111.7%
 
73111.7%
 
53111.7%
 
33111.7%
 
13111.7%
 
63011.3%
 
43011.3%
 
22910.9%
 
9217.9%
 
ValueCountFrequency (%) 
13111.7%
 
22910.9%
 
33111.7%
 
43011.3%
 
53111.7%
 
63011.3%
 
73111.7%
 
83111.7%
 
9217.9%
 
ValueCountFrequency (%) 
9217.9%
 
83111.7%
 
73111.7%
 
63011.3%
 
53111.7%
 
43011.3%
 
33111.7%
 
22910.9%
 
13111.7%
 

SEM_CFD
Real number (ℝ≥0)

HIGH CORRELATION

Distinct39
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.71320755
Minimum1
Maximum39
Zeros0
Zeros (%)0.0%
Memory size2.2 KiB

Quantile statistics

Minimum1
5-th percentile3
Q110
median20
Q329
95-th percentile37
Maximum39
Range38
Interquartile range (IQR)19

Descriptive statistics

Standard deviation10.95275733
Coefficient of variation (CV)0.5556050329
Kurtosis-1.198329576
Mean19.71320755
Median Absolute Deviation (MAD)9
Skewness-0.0005269962524
Sum5224
Variance119.9628931
MonotocityIncreasing
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
2072.6%
 
3872.6%
 
1872.6%
 
1772.6%
 
1672.6%
 
1572.6%
 
1472.6%
 
1372.6%
 
1272.6%
 
1172.6%
 
1072.6%
 
972.6%
 
872.6%
 
772.6%
 
672.6%
 
572.6%
 
472.6%
 
1972.6%
 
2172.6%
 
272.6%
 
2272.6%
 
3772.6%
 
3672.6%
 
3572.6%
 
3472.6%
 
Other values (14)9034.0%
 
ValueCountFrequency (%) 
151.9%
 
272.6%
 
372.6%
 
472.6%
 
572.6%
 
672.6%
 
772.6%
 
872.6%
 
972.6%
 
1072.6%
 
ValueCountFrequency (%) 
3910.4%
 
3872.6%
 
3772.6%
 
3672.6%
 
3572.6%
 
3472.6%
 
3372.6%
 
3272.6%
 
3172.6%
 
3072.6%
 

F_CFD
Categorical

UNIQUE

Distinct265
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
06/06/2020
 
1
21/09/2020
 
1
12/09/2020
 
1
10/05/2020
 
1
10/09/2020
 
1
Other values (260)
260 
ValueCountFrequency (%) 
06/06/202010.4%
 
21/09/202010.4%
 
12/09/202010.4%
 
10/05/202010.4%
 
10/09/202010.4%
 
17/02/202010.4%
 
30/04/202010.4%
 
02/06/202010.4%
 
25/08/202010.4%
 
16/03/202010.4%
 
18/05/202010.4%
 
05/02/202010.4%
 
02/02/202010.4%
 
28/04/202010.4%
 
15/03/202010.4%
 
02/04/202010.4%
 
29/03/202010.4%
 
18/01/202010.4%
 
11/01/202010.4%
 
25/07/202010.4%
 
28/01/202010.4%
 
05/09/202010.4%
 
22/02/202010.4%
 
06/02/202010.4%
 
07/07/202010.4%
 
Other values (240)24090.6%
 
Frequencies of value counts

Unique

Unique265 ?
Unique (%)100.0%
Histogram of lengths of the category

Length

Max length10
Median length10
Mean length10
Min length10

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number212080.0%
 
Other Punctuation53020.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
090142.5%
 
266731.5%
 
11537.2%
 
3693.3%
 
5572.7%
 
7572.7%
 
8572.7%
 
4562.6%
 
6562.6%
 
9472.2%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/530100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common2650100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII2650100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
090134.0%
 
266725.2%
 
/53020.0%
 
11535.8%
 
3692.6%
 
5572.2%
 
7572.2%
 
8572.2%
 
4562.1%
 
6562.1%
 
9471.8%
 

C_CFD
Real number (ℝ≥0)

ZEROS

Distinct119
Distinct (%)44.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.63396226
Minimum0
Maximum325
Zeros88
Zeros (%)33.2%
Memory size2.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median24
Q3101
95-th percentile214.2
Maximum325
Range325
Interquartile range (IQR)101

Descriptive statistics

Standard deviation71.08812005
Coefficient of variation (CV)1.212405188
Kurtosis0.7071538764
Mean58.63396226
Median Absolute Deviation (MAD)24
Skewness1.173414062
Sum15538
Variance5053.520812
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
08833.2%
 
1124.5%
 
372.6%
 
251.9%
 
11141.5%
 
2131.1%
 
7731.1%
 
8831.1%
 
8931.1%
 
6731.1%
 
4731.1%
 
12720.8%
 
10520.8%
 
5620.8%
 
12320.8%
 
6220.8%
 
6520.8%
 
6620.8%
 
10620.8%
 
10420.8%
 
14020.8%
 
620.8%
 
6820.8%
 
9720.8%
 
9520.8%
 
Other values (94)10338.9%
 
ValueCountFrequency (%) 
08833.2%
 
1124.5%
 
251.9%
 
372.6%
 
410.4%
 
510.4%
 
620.8%
 
710.4%
 
810.4%
 
910.4%
 
ValueCountFrequency (%) 
32510.4%
 
29210.4%
 
25010.4%
 
24620.8%
 
23810.4%
 
23110.4%
 
22810.4%
 
22410.4%
 
22210.4%
 
22110.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

MESSEMFECHABEB_REFCAR_PAQCAR_PKLCOMP_ENVCOMP_PAQLACT_QTOR_TORVENTACOSTOUTILDESCMES_CFDSEM_CFDF_CFDC_CFD
01101/01/20200.00.00.000000.00.00.00.00.000000.0000000.0000000.0001101/01/20200
11102/01/20208.016.019.6057017.02.01.08.09635.644704877.0560604768.5886400.0001102/01/20200
21103/01/20205.09.07.550007.5NaNNaN5.04361.450002221.7760002139.6740000.0001103/01/20200
31104/01/202012.08.035.9976231.0NaN1.020.011180.231185302.9364005877.2947800.0001104/01/20200
41105/01/202016.017.053.0463449.0NaN3.044.019168.844269682.9816509485.86261028.5951105/01/20200
51206/01/20204.06.09.2550028.5NaNNaN14.05184.395002711.2372752473.1577250.0001206/01/20200
61207/01/20203.01.010.2540910.0NaN1.04.02776.886311379.1475581397.7387520.0001207/01/20200
71208/01/20202.08.019.0068215.0NaNNaN9.07805.559983714.7289004090.8310803.8701208/01/20200
81209/01/20206.010.012.9934117.54.0NaN9.06082.474993036.4433753068.0316150.0001209/01/20200
91210/01/20204.02.014.6220115.02.0NaN8.04425.489492179.4396402257.8498500.0001210/01/20200

Last rows

MESSEMFECHABEB_REFCAR_PAQCAR_PKLCOMP_ENVCOMP_PAQLACT_QTOR_TORVENTACOSTOUTILDESCMES_CFDSEM_CFDF_CFDC_CFD
25593712/09/202011.012.061.7632537.0NaNNaN22.018945.475358194.10050010751.3748500.00093712/09/202047
25693713/09/202022.029.078.4569878.52.02.038.031952.7876215027.91660016934.87102043.04093713/09/202047
25793814/09/20203.07.016.625008.5NaNNaN5.05971.875002922.1140003049.7610000.00093814/09/2020104
25893815/09/20204.03.024.0745921.0NaNNaN12.07154.614713470.7020503683.9126600.00093815/09/202065
25993816/09/20201.021.040.6853128.02.0NaN20.017131.685597924.4843359217.2012550.00093816/09/202069
26093817/09/20200.0NaN1.8000010.0NaNNaN4.0953.20000499.000000464.2000000.00093817/09/202087
26193818/09/20200.04.012.0025112.5NaNNaN15.04618.499492265.0512002353.4482900.00093818/09/202077
26293819/09/20202.05.07.1214336.5NaN3.030.04544.499372932.8613201621.6380500.00093819/09/202056
26393820/09/20204.02.015.1200044.5NaN2.026.05894.995003289.0395002605.95550017.48593820/09/202020
26493921/09/20200.02.07.5106812.5NaNNaN2.02478.998621430.1609621148.8376580.00093921/09/202054